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<210>INFORMATION FOR SEQ ID N0:1 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1610 base pairs 

(B) TYPE: nudeic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(vi) ORIGINAL SOURCE: 

(C) ISOLATE: DNA coding sequence for HIV capsid protein Pr55 

<400> SEQUENCE DESCRIPTION: SEQ ID N0:1 



ATGGGTGCGA 


GAGCGTCGGT 


ATTAAGCGGG 


GGAGAATTAG 


ATAAATGGGA 


AAAAATTCGG 


60 


TTAAGGCCAG 


GGGGAAAGAA 


AAAATATAAG 


TTAAAACATA 


TAGTATGGfeC 


AAGCAGGGAG 


120 


CTAGAACGAT 


TCGCAGTCAA 


TCCTGGCCTG 


TTAGAAACAT 


CAGAAGGCTG 


CAGACAAATA 


180 


TTGGGACAGC 


TACAGCCATC 


CCTTCAGACA 


GGATCAGAAG 


AACTTAGATC 


ATTATATAAT 


240 


ACAGTAGCAA 


CCCTCTATTG 


TGTACATCAA 


AGGATAGATG 


TAAAAGACAC 


CAAGGAAGCT 


300 


TTAGAGAAGA 


TAGAGGAAGA 


GCAAAACAAA 


AGTAAGAAAA 


AGGCACAGCA 


AGCAGCAGCT 


360 


GCAGCTGGCA 


CAGGAAACAG 


CAGCCAGGTC 


AGCCAAAATT 


ACCCTATAGT 


GCAGAACCTA 


420 


CAGGGGCAAA 


TGGTACATCA 


GGCCATATCA 


CCTAGAACTT 


TAAATGCATG 


GGTAAAAGTA 


480 


GTAGAAGAAA 


AGGCTTTCAG 


CCCAGAAGTA 


ATACCCATGT 


TTTCAGCATT 


ATCAGAAGGA 


540 


GCCACCCCAC 


AAGATTTAAA 


CACCATGCTA 


AACACAGTGG 


GGGGACATCA 


AGCAGCCATG 


600 


CAAATGTTAA 


AAGAGACTAT 


CAATGAGGAA 


GCTGCAGAAT 


GGGATAGAGT 


GCATCCAGTG 


660 


CATGCAGGGC 


CTATTGCACC 


AGGCCAAATG 


AGAGAACCAA 


GGGGAAGTGA 


CATAGCAGGA 


720 


ACTACTAGTA 


CCCTTCAGGA 


ACAAATAGGA 


TGGATGACAA 


ATAATCCACC 


TATCCCAGTA 


780 


GGAGAAATCT 


ATAAAAGATG 


GATAATCCTG 


GGATTAAATA 


AAATAGTAAG 


AATGTATAGC 


840 


CCTACCAGCA 


TTCTGGACAT 


AAGACAAGGA 


CCAAAGGAAC 


CCTTTAGAGA 


TTATGTAGAC 


900 


CGGTTCTATA 


T^CTCTAAG 


AGCCGAACAA 


GCTTCACAGG 


ATGTAAAAAA 


TTGGATGACA 


960 


GAAACCTTGT 


TGGTCCAAAA 


TGCAAACCCA 


GATTGTAAGA 


CTATTTTATU^ 


AGCATTGGGA 


1020 


CCAGCAGCTA 


CACTAGAAGA 


AATGATGACA 


GCATGTCAGG 


GAGTGGGGGG 


ACCCGGCCAT 


1080 


AAAGCAAGAG 


TTTTGGCTGA 


AGCCATGAGC 


CAAGTAACAA 


ATCCAGCTAA 


CATAATGATG 


1140 


CAGAGAGGCA 


ATTTTAGGAA 


CCAAAGAAAG 


ACTGTTAAGT 


GTTTCAATTG 


TGGCAAAGAA 


1200 



. (C) ISOLATE: Degenerate oligonucleotide C- terminal peptide 
sequence of WGHP68 

<400> SEQUENCE DESCRIPTION: SEQ ID NO: 3 
ATGAATTCAC TGGGACTGCG GATAGATTAC TGGTACTGGG GATC 44 
<210> INFORMATION FOR SEQ ID NO: 4 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 
(vi) ORIGINAL SOURCE: 

(C) ISOLATE: Degenerate oligonucleotide C- terminal peptide 
sequence of WGHP68 

<400> SEQUENCE DESCRIPTION: SEQ ID NO: 4 
ATGAATTCAC TGGGCTCTGA TAGATTACTG GTACTGGGGA TC 42 



<210> SEQ ID NO: 5 
<211> Length: 604 
<212> Type: PRT 

<213> Organism: Triricum aestivum 
<400> Sequence 5 

Met Ala Asp Arg Leu Thr Arg lie Ala Ila Val Ser Glu Asp Lys Cys 
15 10 15 

Lys Pro Lys Lys Cys Arg Gin Glu Cys Lys Lys Ser Cys Pro Val Val 

20 25 30 

Lys Thr Gly Lys Leu Cys lie Glu Val Ser Pro Val Ala Lys Leu Ala 

35 40 45 

Phe lie Ser Glu Glu Leu Cys lie Gly Cys Gly lie Cys Val Lys Lys 



50 , 55 60 

Cys Pro Phe Asp Ala He Glu He He Asn Lau Pro Lys Asp Leu Glu 
65 70 75 80 

Lys Asp Thr Thr H.\s Arg Tyr Gly Pro Asn Thr Phe Lys Leu His Arg 

85 90 95 

Leu Pro Val Pro Arg Pro Gly Gin Val Leu Gly Leu Val Gly Thr Asn 

100 105 110 

Gly He Gly Lys Ser Thr Ala Leu Lys Val Leu Ala Gly Lys Leu Lys 

lis 120 125 

Pro Asn Leu Gly Arg Phe Lys Asn Pro Pro Asp Trp Gin Glu He Leu 

130 135 140 

Thr Tyr Phe Arg Gly Ser Glu Leu Gin Asn Tyr Phe Thr Arg He Leu 
145 150 155 160 

Glu Asp Asn Leu Lys Ala He He Lys Pro Gin Tyr Val Asp His He 

165 170 175 

Pro Lys Ala Val Gin Gly Asn Val Gly Gin Val Leu Glu Gin Lys Asp 

180 185 190 

Glu Arg Asp Met Lys Asn Glu Leu Cys Val Asp Leu Glu Leu Asn Gin 

195 200 205 

Val He Asp Arg Asn Val Gly Asp Leu Ser Gly Gly Glu Leu Gin Arg 

210 215 220 

Phe Ala He Ala Val Val Ala Val Gin Ser Ala Glu He Tyr Met Phe 
225 230 235 240 

Asp Glu Pro Ser Ser Tyr Leu Asp Val Lys Gin Arg Leu Lys Ala Ala 

245 250 255 

Arg Val He Arg Ser Leu Leu Arg Ser Asn Ser Tyr Val He Val Val 

260 265 270 

Glu His Asp Leu Ser Val Leu Asp Tyr Leu Ser Asp Phe He Cys Cys 

275 280 285 

Leu Tyr Gly Lys Pro Gly Ala Tyr Gly Val Val Thr Leu Pro Phe Ser 

290 295 300 

Val Arg Glu Gly He Asn He Phe Leu Ala Gly Phe Val Pro Thr Glu 
305 310 315 320 

Asn Leu Arg Phe Arg Asp Glu Ser Leu Thr Phe Lys He Ala Glu Thr 

325 330 335 

Gin Glu Ser Ala Glu Glu Val Ala Thr Tyr Gin Arg Tyr Lys Tyr Pro 

340 345 350 

Thr Met Ser Lys Thr Gin Gly Asn Phe Lys Leu Ser Val Val Glu Gly 



. 355 360 365 

Glu Phe Thr Asp Ser Gin He Val Val Met l3U Gly Glu Asn Gly Thr 

370 375 380 

Gly Lys Thr Thr Phe He Arg Met Leu Ala Gly Leu Leu Lys Pro Asp 
385 390 395 400 

Thr Met Glu Gly Thr Glu Val Glu He Pro Glu Phe Asn Val Ser Tyr 

405 410 415 

Lys Pro Gin Lys He Ser Pro Lys Phe Gin His Pro Val Arg His Leu 

420 425 430 

Leu His Ser Lys He Arg Asp Ser Tyr Thr His Pro Gin Phe Val Ser 

435 440 445 

Asp Val Met Lys Pro Leu Gin He Glu Gin Leu Met Asp Gin Glu Val 

450 455 460 

He Asn Leu Ser Gly Gly Glu Leu Gin Arg Val Ala Leu Cys Leu Cys 
465 470 475 480 

Leu Gly Lys Pro Ala Asp He Tyr Leu He Asp Glu Pro Ser Ala Tyr 

485 490 495 

Leu Asp Ser Glu Gin Arg He Val Ala Ser Lys Val He Lys Arg Phe 

500 505 510 

He Leu His Ala Lys Lys Thr Ala Phe He Val Glu His Asp Phe He 

515 520 ' 525 

Met Ala Thr Tyr Leu Ala Asp Lys Val He Val Tyr Glu Gly Leu Ala 

530 535 540 

Ser He Asp Cys Thr Ala Asn Ala Pro Gin Ser Leu Val Ser Gly Met 
545 550 555 560 

Asn Lys Phe Leu Ser Kis Leu Asp He Thr Phe Arg Arg Asp Pro Thr 

565 570 575 

Asn Tyr Arg Pro Arg He Asn Lys Leu Glu Ser Thr Lys Asp Arg Glu 

580 585 590 

Gin Lys Asn Ala Gly Ser Tyr Tyr Tyr Leu Asp Asp 
595 600 



<210> SEQ ID N0:6 

<211> LENGTH: 599 

<212> TYPE: PRT 

<213> ORGANISM: Homo sapians 

<400> SEQUENCE 6 



Met Ala- Asp Lys Leu Thr Arg lie Ala lie Val Asn His Asp Lys Cys 
15 10 15 

Lys Pro Lys Lys Cys Arg Gin Glu Cys Lys Lys Ser Cys Pro Val Val 

20 25 30 

Arg Met Gly Lys Leu Cys lie Glu Val Thr Pro Gin Ser Lys lie Ala 

35 40 45 

Trp lie Ser Glu Thr Leu Cys lie Gly Cys Gly lie Cys lie Lys Lys 

50 55 60 

Cys Pro Phe Gly Ala Leu Ser lie Val Asn Leu Pro Ser Asn Leu Glu 
65 70 75 80 

Lys Glu Thr Thr His Arg Tyr Cys Ala Asn Ala Phe Lys Leu His Arg 

85 90 95 

Leu Pro lie Pro Arg Pro Gly Glu Val Leu Gly Leu Val Gly Thr Asn 

100 105 110 

Gly lie Gly Lys Ser Ala Ala Leu Lys lie Leu Ala Gly Lys Gin Lys 

115 120 125 

Pro Asn Leu Gly Lys Tyr Asp Asp Pro Pro Asp Trp Gin Glu lie Leu 

130 135 140 

Thr Tyr Phe Arg Gly Ser Glu Leu Gin Asn Tyr Phe Thr Lys lie Leu 
145 150 155 160 

Glu Asp Asp Leu Lys Ala lie lie Lys Pro Gin Tyr Val Ala Arg Phe 

165 170 175 

Leu Arg Leu Ala Lys Gly Thr Val Gly Ser lie Leu Asp Arg Lys Asp 

180 185 190 

Glu Thr Lys Thr Gin Ala lie Val Cys Gin Gin Leu Asp Leu Thr His 

195 200 205 

Leu Lys Glu Arg Asn Val Glu Asp Leu Ser Gly Gly Glu Leu Gin Arg 

210 215 220 

Phe Ala Cys Ala Val Val Cys lie Gin Lys Ala Asp lie Phe Met Phe 
225 230 235 240 

Asp Glu Pro Ser Ser Tyr Leu Asp Val Lys Gin Arg Leu Lys Ala Ala 

245 250 255 

lie Thr lie Arg Ser Leu lie Asn Pro Asp Arg Tyr lie lie Val Val 

260 265 270 

Glu His Asp Leu Ser Val Leu Asp Tyr Leu Ser Asp Phe lie Cys Cys 

275 280 285 

Leu Tyr Gly Val Pro Ser Ala Tyr Gly Val Val Thr Met Pro Phe Ser 
290 295 300 



# 



Val Arg. Glu Gly lie Asn lie Phe Leu Asp Gly Tyr Val Pro Thr Glu 
305 310 215 320 

Asn Leu Arg Phe Arg Asp Ala Ser Leu Val Phe Lys Val Ala Glu Thr 

325 330 335 

Ala Asn Glu Glu Glu Val Lys Lys Met Cys Met Tyr Lys Tyr Pro Gly 

340 345 350 

Met Lys Lys Lys Met Gly Glu Phe Glu Leu Ala lie Val Ala Gly Glu 

355 360 365 

Phe Thr Asp Ser Glu lie Met Val Met Leu Gly Glu Asn Gly Thr Gly 

370 375 380 

Lys Thr Thr Phe lie Arg Met Leu Ala Gly Arg Leu Lys Pro Asp Glu 
385 390 395 400 

Gly Gly Glu Val Pro Val Leu Asn Val Ser Tyr Lys Pro Gin Lys lie 

405 410 415 

Ser Pro Lys Ser Thr Gly Ser Val Arg Gin Leu Leu His Glu Lys lie 

420 425 430 

Arg Asp Ala Tyr Thr His Pro Gin Phe Val Thr Asp Val Met Lys Pro 

435 440 445 

Leu Gin lie Glu Asn lie lie Asp Gin Glu Val Gin Thr Leu Ser Gly 

450 455 460 

Gly Glu Leu Gin Arg Val Arg Leu Arg Leu Cys Leu Gly Lys Pro Ala 
465 470 475 480 

Asp Val Tyr Leu lie Asp Glu Pro Ser Ala Tyr Leu Asp Ser Glu Gin 

485 490 495 

Arg Leu Met Ala Ala Arg Val Val Lys Arg Phe lie Leu His Ala Lys 

500 505 510 

Lys Thr Ala Phe Val Val Glu His Asp Phe lie Met Ala Thr Tyr Leu 

515 520 525 

Ala Asp Arg Val lie Val Phe Asp Gly Val Pro Ser Lys Asn Thr Val 

530 535 540 

Ala Asn Ser Pro Gin Thr Leu Leu Ala Gly Met Asn Lys Phe Leu Ser 
545 550 555 560 

Gin Leu Glu lie Thr Phe Arg Arg Asp Pro Asn Asn Tyr Arg Pro Arg 

565 570 575 

lie Asn Lys Leu Asn Ser lie Lys Asp Val Glu Gin Lys Lys Ser Gly 

580 585 590 

Asn Tyr Phe Phe Leu Asp Asp 

595 



